Contour regression: A distribution-regularized regression framework for climate modeling

نویسندگان

  • Zubin Abraham
  • Pang-Ning Tan
  • Perdinan
  • Julie Winkler
  • Shiyuan Zhong
  • Malgorzata Liszewska
چکیده

Regression methods are commonly used to learn the mapping from a set of predictor variables to a continuousvalued target variable such that their prediction errors are minimized. However, minimizing the errors alone may not be sufficient for some applications, such as climate modeling, which require the overall predicted distribution to resemble the actual observed distribution. On the other hand, histogram equalization methods, such as quantile mapping, are often used in climate modeling to alter the distribution of input data to fit the distribution of observed data, but they provide no guarantee of accurate predictions. This paper presents a flexible regression framework known as contour regression that simultaneously minimizes the prediction error and removes biases in the predicted distribution. The framework is applicable to linear, nonlinear, and conditional quantile models and can utilize data from heterogenous sources. We demonstrate the effectiveness of the framework in fitting the daily minimum and maximum temperatures as well as precipitation for 14 climate stations in Michigan. The framework showed marked improvement over standard regression methods in terms of minimizing their distribution bias.  2014 Wiley Periodicals, Inc. Statistical Analysis and Data Mining, 2014

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distribution Regularized Regression Framework for Climate Modeling

Regression-based approaches are widely used in climate modeling to capture the relationship between a climate variable of interest and a set of predictor variables. These approaches are often designed to minimize the overall prediction errors. However, some climate modeling applications emphasize more on fitting the distribution properties of the observed data. For example, histogram equalizati...

متن کامل

Modeling Current and Future Potential Distributions of Caspian Pond Turtle (Mauremys caspica) under Climate Change Scenarios

Although turtles are the most threatened taxonomic group within the reptile class, we have a very limited understanding of how turtles respond to climate change. Here, we evaluated the effects of climate changes on the geographical distribution of Caspian pond turtle (Mauremys caspica). We used an ensemble approach by combining six species distribution models including artificial neural network...

متن کامل

Regularized fuzzy clusterwise ridge regression

Fuzzy clusterwise regression has been a useful method for investigating cluster-level heterogeneity of observations based on linear regression. This method integrates fuzzy clustering and ordinary least-squares regression, thereby enabling to estimate regression coefficients for each cluster and fuzzy cluster memberships of observations simultaneously. In practice, however, fuzzy clusterwise re...

متن کامل

Analysis of Incomplete Climate Data: Estimation of Mean Values and Covariance Matrices and Imputation of Missing Values

Estimating the mean and the covariance matrix of an incomplete dataset and filling in missing values with imputed values is generally a nonlinear problem, which must be solved iteratively. The expectation maximization (EM) algorithm for Gaussian data, an iterative method both for the estimation of mean values and covariance matrices from incomplete datasets and for the imputation of missing val...

متن کامل

Efficient Multiclass Implementations of L1-Regularized Maximum Entropy

This paper discusses the application of L1-regularized maximum entropy modeling or SL1-Max [9] to multiclass categorization problems. A new modification to the SL1-Max fast sequential learning algorithm is proposed to handle conditional distributions. Furthermore, unlike most previous studies, the present research goes beyond a single type of conditional distribution. It describes and compares ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Statistical Analysis and Data Mining

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2014